ARating-RankingMethod for Crowdsourced Top-k Computation
نویسندگان
چکیده
Crowdsourced top-k computation aims to utilize the human ability to identify top-k objects from a given set of objects. Most of existing studies employ a pairwise comparison based method, which first asks workers to compare each pair of objects and then infers the top-k results based on the pairwise comparison results. Obviously, it is quadratic to compare every object pair and these methods involve hugemonetary cost, especially for large datasets. To address this problem, we propose a rating-ranking-based approach, which contains two types of questions to ask the crowd. The first is a rating question, which asks the crowd to give a score for an object. The second is a ranking question, which asks the crowd to rank several (e.g., 3) objects. Rating questions are coarse grained and can roughly get a score for each object, which can be used to prune the objects whose scores are much smaller than those of the topk objects. Ranking questions are fine grained and can be used to refine the scores. We propose a unified model to model the rating and ranking questions, and seamlessly combine them together to compute the top-k results. We also study how to judiciously select appropriate rating or ranking questions and assign them to a coming worker. Experimental results on real datasets show that our method significantly outperforms existing approaches. ACM Reference Format: Kaiyu Li, Xiaohang Zhang, Guoliang Li. 2018. A Rating-RankingMethod for Crowdsourced Top-k Computation. In Proceedings of 2018 International Conference on Management of Data (SIGMOD/PODS ’18). ACM, New York, NY, USA, 16 pages. https://doi.org/10.1145/3183713.3183762
منابع مشابه
Crowdsourced Top-k Algorithms: An Experimental Evaluation
Crowdsourced top-k computation has attracted significant attention recently, thanks to emerging crowdsourcing platforms, e.g., Amazon Mechanical Turk and CrowdFlower. Crowdsourced top-k algorithms ask the crowd to compare the objects and infer the top-k objects based on the crowdsourced comparison results. The crowd may return incorrect answers, but traditional top-k algorithms cannot tolerate ...
متن کاملA Confidence-Aware Top-k Query Processing Toolkit on Crowdsourcing
Ranking techniques have been widely used in ubiquitous applications like recommendation, information retrieval, etc. For ranking computation hostile but human friendly items, crowdsourcing is considered as an emerging technique to process the ranking by human power. However, there is a lack of an easy-to-use toolkit for answering crowdsourced top-k query with minimal effort. In this work, we de...
متن کاملEfficient Techniques for Crowdsourced Top-k Lists
We focus on the problem of obtaining top-k lists of items from larger itemsets, using human workers for doing comparisons among items. An example application is short-listing a large set of college applications using advanced students as workers. We describe novel efficient techniques and explore their tolerance to adversarial behavior and the tradeoffs among different measures of performance (...
متن کاملCrowdsourced Query Processing on Microblogs
Currently, crowdsourced query processing is done on rewarddriven platforms such as Amazon Mechanical Turk (AMT) and CrowdFlower. However, due to budget constraints for conducting a crowdsourcing task in practice, the scalability is inherently poor. In this paper, we exploit microblogs for supporting crowdsourced query processing. We leverage the social computation power and decentralize the eva...
متن کاملA Crowdsourced Encoding Approach for Handwritten Sheet Music
We present an approach that uses human computation and crowdsourcing principles for encoding large amounts of monophonic, handwritten sheet music.
متن کامل